Sequential Change-Point Detection for High-Dimensional and Non-Euclidean Data
نویسندگان
چکیده
In many applications, it is often of practical and scientific interest to detect anomaly events in a streaming sequence high-dimensional or non-Euclidean observations. We study non-parametric framework that utilizes nearest neighbor information among the observations changes an online setting. It can be applied data arbitrary dimension as long similarity measure on sample space defined. consider new test statistics under this more effectively than existing while keeping false discovery rate controlled at fixed level. Analytic formulas approximating average run lengths approaches are derived make them fast applicable modern datasets. Simulation studies provided support theoretical results. The proposed approach illustrated with analysis NYC taxi dataset.
منابع مشابه
on the bayesian sequential change-point detection
the problems of sequential change-point have several important applications in quality control, signal processing, and failure detection in industry and finance. we discuss a bayesian approach in the context of statistical process control: at an unknown time $tau$, the process behavior changes and the distribution of the data changes from p0 to p1. two cases are considered: (i) p0 and p1 are fu...
متن کاملChange-point models and performance measures for sequential change detection
For the problem of sequential change detection we propose a novel modelling of the change-point mechanism. In particular we regard the time of change as a stopping time controlled by Nature. Nature, in order to decide when to impose the change, accesses sequentially information which can be different from the information provided to the Statistician to detect the change. Using as performance me...
متن کاملDynamic Frailty and Change Point Models for Recurrent Events Data
Abstract. We present a Bayesian analysis for recurrent events data using a nonhomogeneous mixed Poisson point process with a dynamic subject-specific frailty function and a dynamic baseline intensity func- tion. The dynamic subject-specific frailty employs a dynamic piecewise constant function with a known pre-specified grid and the baseline in- tensity uses an unknown grid for the piecewise ...
متن کاملOutlier Detection in High Dimensional, Spatial and Sequential Data Sets
Of all the data mining techniques, outlier detection seems closest to the definition of “discovering nuggets of information” in large databases. When an outlier is detected, and determined to be genuine, it can provide insights, which can radically change our understanding of the underlying process. The purpose of the research underlying this thesis was to investigate and devise methods to mine...
متن کاملStatistics for Change Detection in High–dimensional Data Streams
The method of change (or anomaly) detection in high-dimensional discrete-time processes using a multivariate Hotelling chart is presented. We use normal random projections as a method of dimensionality reduction. We indicate diagnostic properties of the Hotelling control chart applied to data projected onto a random subspace of R. We examine the random projection method using artificial noisy i...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: IEEE Transactions on Signal Processing
سال: 2022
ISSN: ['1053-587X', '1941-0476']
DOI: https://doi.org/10.1109/tsp.2022.3205763